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We further develop the large N formalism presented by some of us in earlier works in order to 
recursively calculate the partition function of a singly pseudoknotted RNA. We demonstrate that 
this calculation takes time proportional to the sixth power of the length of the RNA. The algorithm 
itself is presented in a self-contained form for the convenience of readers interested in implementing 
it. 
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I. INTRODUCTION 

An RNA molecule is a heteropolymer, consisting of four 
types of nucleotides — uracil (U), adenine (A), guanine 
(G) and cytosine (C) — which are linked into a chain by 
phosphate bonds. The nucleotides then form saturating 
hydrogen bonds with one another, with A bonding to U 
and C bonding to G. These bonds cause the RNA to 
fold up into a stable, three dimensional structure. The 
structure is further stabilized by the stacking interac- 
tions, where runs of adjacent bonds twist into the familiar 
Watson- Crick helices. 

The folding process is believed to be roughly 
hierarchical^]^], in that the primary (one-dimensional) 
structure, defined as the sequence of nucleotides, de- 
termines the secondary (two-dimensional) structure. If 
one visualizes the RNA as a circle, tying the first nu- 
cleotide to the last, and then represents hydrogen bonds 
as arcs within the circle, the secondary structure is the 
structure that can be drawn without bonds crossing 
one another Q, The secondary structure is largely 
unaffected by the formation of the (three-dimensional) 
tertiary structure, which includes, among other things, 
pseudoknots. Pseudoknots are precisely those structures 
which do contain bonds that would cross each other in- 
side the disk. Algorithms that allow for the prediction 
of certain pseudoknots exist. For instance, @ allows for 
any number of bonds on the outside of the disk, as long 
as they do not cross. Our algorithm allows for a different 
set of topologies, enumerated in 

We only consider a basic model of RNA, in which we 
only account for the energies associated with bond for- 
mation and neglect stacking interactions, loop entropies, 
and steric constraints. Modern RNA folding algorithms 
tend to be more complex, especially towards secondary 
structure 0, |n|, but we opt for a simpler approach for 
the sake of clarity in this paper. As a result, the imple- 
mentation we outline here is merely an illustration; it is 
not useful for predicting the structure of folded RNA. 

Let —Uij be the change in energy of forming a hy- 



drogen bond between the ith and jth bases, and let 
Vij = exp(Uij /T) be the Boltzmann factor at temper- 
ature T . Then one can write the partition function as a 



sum of terms of the form Vi 1 i 2 Vi 3 i i 
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in the above sum must be unique because a base is satu- 
rated by a single hydrogen bond. Our partition function 
is then 



i<j i<j<k<l 



1-2 E v » v * 



N 2 



(1) 



i<j<k<l 



The factor of N is a parameter which tracks the topol- 
ogy of the bond structure. Since there is usually a 
small number of pseudoknots, and they can be con- 
trolled by the concentration of Mg ++ ions in solution, 
we have introduced the parameter N, and penalize terms 
involving pseudoknots with factors of l/N 2 . The sec- 
ondary structure is 0(1), while a restricted set of singly- 
pseudoknotted structures are O (N~ 2 )\j^. 

There are many (~ L\) possible terms in the above 
sum. We propose an algorithm which allows us to calcu- 
late this partition function to O (N~ 2 ) in O (i 6 ) time. 
We also generalize this algorithm to calculate the parti- 
tion function of an RNA that contains an arbitrary num- 
ber of these O (N~ 2 ^ pseudoknots. We can then use 
backtracking to determine what bonds contribute to the 
dominant term in the partition function, and thus the 
folded structure at low temperatures. We can also de- 
termine the probability that a given bond is present at a 
finite temperature. 

In order to express sums like P in a convenient fash- 
ion, we introduced two objects, which we call "propaga- 
tors" . The first of these is the two-indexed Gij , which 
is the partition function of the secondary structure of a 
chain of nucleotides identical to that which lies between 
the jth base and the ith base. We set Gi,i+i = 1 and 
Gis+i = for I > 1, which forbids "backwards propaga- 
tion" , insuring Gy always runs in the 3' to 5' direction. 
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The other propagator is the four- indexed Ay^;, which 
is non-zero if i > k > I > j. In that case, Ay.^ is the 
partition function of the structure of two spatially sepa- 
rated, anti-parallel, chains of nucleotides, one between k 
and i, and the other between j and I, . 

These definitions allow us to depict topological classes 
of structures with Feynman diagrams. In these diagrams, 
Gij is an arrow pointing from j to i, and A, 3; tj is a box 
with solid sides with corners at i, j, k, and I. Individual 
Vij bonds are dashed lines from i to j. These diagrams 
are typically in one-to-one correspondence with sums, as 
in 0. The diagrams, as well as typi cal as socia ted st ruc- 



tures, are shown in fig. ^ I n n 8' s - 1(b) and |l(dj| the 
heavy lines represent the RNA backbone, and the empty 
circles represent nucleotides. 



(a) Diagram 
for Gia. 




The procedure for calculating the partition function 
remains a recursive one, though. One finds the partition 
function Zl+i,i of an L + 1 nucleotide RNA in terms of 
the structures of the L nucleotide RNA. This is simple 
enough for the secondary structure, because it is defined 
to exclude crossings. Thus, one considers only structures 
where a single bond arcs over complete secondary struc- 
tures. This reasoning will lead one to the defining relation 
for G described in section ITTT| and, indeed, it is how the 
equation was first derived[3j. 

However, crossings are the defining feature of pseudo- 
knots, so our recursion relation will need to handle them. 
This requires the introduction of the "vertex operators" 
mentioned in section VI of pj. These vertex operators 
are defined to be equal the partition functions of the sec- 
ondary and pseudoknot structures on runs of nucleotides, 
with the additional requirement that a given nucleotide 
is unsaturated, so an incoming hydrogen bond can at- 
tach there. For example, the structure between the bases 
marked i and j in fig. [21 can be associated with a term 
in a vertex operator that allows an incoming bond from 
the L + 1st base to connect at k. 



(b) Typical structure Gij. 



< k 
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FIG. 2: The bond from L + 1 to k is allowed when considering 
pseudoknots. 



(c) Diagram (d) Typical 

for Dij.^i. structure 
Dij-ki- 

FIG. 1: G and A. 

We begin by summarizing the results relevant to a com- 
puter implementation of the algorithm are in section ITT1 
We present two variations on the algorithm. One allows 
for the calculation of the partition function with a single 
pseudoknot. The other allows for any number of pseudo- 
knots. The remainder of the paper presents derivations 
of these results for the interested reader. 

The index structure makes the topology of RNA fold- 
ing similar to that of large N quantum chromodynamics, 
and these diagrams can be expressed using the t'Hooft 
double line formalism 0, 0- We review the derivations 
of the recursive equations which define A and G, as well 
some other techniques and notations from Q and 0, in 
section ITTT1 

Calculating the pseudoknot partition function is con- 
siderably more involved than backtracking the secondary 
structure partition function. Instead of a single, simple 
sum, many topologies must be summed over, and many 
of those involve summations over several indices. In order 
to do these sums efficiently, we define new propagators, 
similar to G and A, in section llVl 



Once these vertex operators are put in a form similar 
to that of the pseudoknots in [2j, which can be summed 
explicitly, it is possible to use them to write a recursion 
relation that can be used to generate a partition function 
that includes pseudoknots. Fortunately, the techniques 
needed to evaluate these vertex operators are very sim- 
ilar to those used to evaluate the pseudoknots. These 
calculations are described in section [V] 



II. ALGORITHM 

Using the methods and results of the previous section, 
one can calculate the partition function of a singly pseu- 
doknotted RNA with L nucleotides as follows: 

1. Generate the matrix from the RNA sequence. 

2. Solve the Hartree equation 

i-1 

Gij = Gi-ij + VikGi-i.k+iGk-ij + 

k=j 

for G. 

3. Use the result for G and the Bethe-Salpeter equa- 
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tion 



i-1 I 



m—k n—j + 1 

to find D. 

4. Use D and G to determine E, K, and L. 

I i 

Eij-ki — y ^ Dij. nm G n ~i y kGi, m +i 

m—j n—k 
I i 

Kij-ki = y y D n j- t k m Gi im -i-iGi, n +i 

m—j n— k 
I i 

Lij^kl ^ ^ ^i7n-nlG n —i j kG m — ij 
m—j n— k 

5. Calculate the vertex Tj-, denned by 



^ij — Eij;J+i,J-i 

Then calculate the 8^- , defined by all the diagrams 
in fig. ^3 The explicit sums associated with the 
terms in 0^ are contained in appendix 1X1 

6. Use the recursion relation 
Zli — Zl-i,i 

L-l 

+ ^ Vlj (Zl-i,j+iGj-i,i + Gl-\,j+\Zl-\.\) 



j=i 



L-l 



Jr ^ ^ VLjGL-l,m+l^mnGn-l,l 

l 

L-l 

y^ VLjGL-l,m+lQ'mnGn-lS 



J=l mn 

L-l 



J=l mn 

to find the partition function Zli- 

Alternatively, one can find the partition function for 
an RNA with an arbitrary number of pseudoknots by 
replacing the recursion relation in step 5 with (|39[1 . 

Zli = Zl-i,i 

L-l 

+ y^ Vlj^L-1,J+1^J-1,1 



L-l 



+ y] X] VLjGi-i^+ir^Gn-i,! 
J=l mn 
L-l 

+ y^ y^ Vi,j(jL-l,m+l6^ n Gn-l,l 



We invite the interested reader to examine the deriva- 
tion of the algorithm in the following sections. The algo- 
rithm can pick out knots. An example is shown in fig. El 
where the (one-pseudoknot) algorithm was applied to the 
sequence AGUC. The arcs represent hydrogen bonds. 



A G U C 
FIG. 3: Sample knot. 



III. MATRIX FORMALISM AND STEEPEST 
DESCENT 

In Q , two of the authors developed a formalism involving 
integrating over sets of matrices in order to exploit known 
results from quantum field theory. In this section, we 
review the techniques and notations associated with this 
formalism that we use in this article. 

The analysis began with a demonstration that Q is 
equivalent to 



7 1 9 

ZL1 -Ndh 



h=0 



dAt 



-4±tr A 



(det M(A)f (2) 



The integral is taken over the space of all (L + 1) x (L+l) 
hermitian matrices A. M(A) is a matrix function of A 



M ij( A ) = 8ij — 8i,j+i +hS i id L+ i j j +i^Vi-ij Ai-xj (3) 
In matrix form, this reads 



/ 1 o 

-1 l + a 12 



M(A) 





ai3 



'12 



-1 l + a 23 



h\ 

aiL 
a 2 L 



V a lL a 2L 



<-l,L "I 1 



(4) 



Here, we have abbreviated the notation so that 



for i < j 
for j > i 



(5) 



Note that we use to refer to the square root of the 
(«, j)th element of V. 

We use the well-known identity det M — e tr log M and 
evaluate the derivative in (0) to get 



Zli 



dAe-^ A2+m * log m ( a )m~ 1 (A)l+i,i (6) 



We can define an expectation value for the function 0(A) 
relative to an action S(A) = itr A 2 — tr logM(A) as 



J—l mn 



(O) = / dAe- NS ^0(A) 



(7) 



4 



This allows us to write the partition function in a partic- 
ularly compact form. 



Z L i = (M 



L+1,1 



(8) 



We can use the steepest descent approach outlined in 
section V of in order to calculate the quantities in 
brackets to O (N~ 2 ) . We expand (JSJ) in terms of a matrix 
fluctuation x, which is defined by 



(9) 



Here, is stationary point of the action S(A). We 
define 



Gi 



M- l {A) i+1 j 



(10) 



From this, it can be shown that the propagator Gij sat- 
isfies a Hartree recursion relation. 



i-l 



Gi-l,j + ^ VikGi-lJe+lGk—l,j + (11) 



k=j 



This can be solved after we impose boundary conditions. 
This equation is familiar from the RNA folding literature, 



J 



and was first used in |3( to determine RNA secondary 
structure. This is to be expected, as G y - is the secondary 
structure partition function for a run of nucleotides be- 
tween j and i. Fig. 0] shows the defining relation for the 
Hartree propagator. 



(a) 



FIG. 4: Hartree equation for Gij 



Next, we find corrections to Gij by expanding S(A + 
x/V~N) as a power series in x. The first term in the 
action is just a quadratic, 



1/t x V 1 751 1 -r 1 9 

-tr [A-\ r =-tvA I ^ r trAaH tree 2 

2 V Ni 2 Mi 2N 



Using the definitions in and (|10|l . we can write 



My [ A 



tr log M A 



N~- 



kl 

tr lo 



gM(A) + tr log ^1 + ■j=Y,G i - 1 , k V$_ ld x k -i d 



If we define Rij = J2 k Gi-\,kV k —\ j%k-i,j, we can expand 



, , , R\ R R 2 

tr log [I + 1— = tr % — — + — 

Ni) \ Ni N 



Terms linear in x vanish because A is the stationary point 
of S(A), so our expectation value becomes 



(O) =C dx exp 



(tr x 2 - tr R 2 



E 



txR p 



O (12) 



with the normalization C absorbing the Jacobian from 
the change of variables and various other constants (like 

)■ 



We reintroduce the shorthand of [l( and Q for powers 
and traces of powers of R in order to facilitate discussion 
of these expectation values. 



Bp 



R" 
tr B n 



Now we can write 



Ni 



(13) 
(14) 



Gk-1,3 (15) 



J ik 



using the power series for (1 + x) . 

We call the free (i.e., quadratic) part of the action 



S {x) 



-trx z 



-tr R 2 



(16) 
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and use it to define the free expectation value, 

(O) = / dxe So{x) (17) 



SO that Vrr, 



if either I > m > k or I > n > k. 



The free action can be written in terms of a kernel A 1 , 
defined by 

i,j,k,l 

-1 11 

A ij-,ki = 6u6jk-Gi-i, i+ iGj-i,k+iVgVM 

Multiplying the four-indexed A -1 by its inverse gives the 
Bethe-Salpeter relation 



Ajj-jfcj — SikSij 
i-l i 

^ij Z_^i L_^i Gi-l^ m +lVm n A mn -klG n -ij+i (18) 



m—k n—j+1 



It is convenient to multiply this equation by and 
i 

Vui and define a slightly different propagator 

D ij]kl = ^JA^yj 
I? obeys a new Bethe-Salpeter relation. 



A, 



ij;kl 



i-l Z 

+ Gi-i.m+iDmn-kiGn-i.j + i 

m—k n—j+1 



(19) 



The diagrammatic form of this equation is shown in 
fig. II I II We can use the D propagator to evaluate ex- 
pectation values by Wick contracting xs, or we can do a 

i 

coordinate transformation in our integral £E.y = Uij/V^ 
and use D to contract ys, since the Jacobian associated 
with this change of variables is just a constant which can 
be absorbed into the normalization. 



s l 



FIG. 5: Bethe-Salpeter equation for D 



ij ; kl 



Solving (|19|l initially appears to be a task that will 
take O (L 6 ) time. However, the diagrammatic form of 
the equation indicates that Dij-ki can be written as 



Di+ij-i-k-ij+i — Vi+ij-i Vk-i,i+\G 



ij\kl 



(20) 



where the four-index G propagator Gij-ki is calculated in 
the same manner as Gij , except the matrix V is modified 



Calculating Gij-ki for a fixed k and / is then a O (L 3 ) 
task which much be performed O (£ 2 ) times, meaning 
that D can be calculated in O (L 5 ) time. 

Because Gy represents the partition function of the 
possible secondary structures on the nucleotides between 
i and j inclusive, we should never see anything involv- 
ing G that looks like a conventional matrix product, 
(e.g, GijDjk-mn), because that would include un- 
physical structures which have two hydrogen bonds end- 
ing at j. However, we see many terms of the form 
5^ • GijDj-i k-mm and so it is convenient, for an arbi- 
trary matrix Q, to define a non-standard matrix product 
for sums involving G, 



(QG)ij — QikGk-lJ 
k 

(GQ)ij = ^ Gj-± t kQkj 



Using these definitions we can write a more explicit 
form of Z 



Z L1 = (exp 



p=3 



E 

9=0 



G 



L+1,1/ 



(21) 



where we have used the series expansion for M 1 ( A + 
x/N?). This expression can then be expanded to 
O (N~ 2 ) , and the various terms can be evaluated using 
Wick's theorem. 



IV. PROPAGATORS 

The steepest descent expansion of <|21|) was completed in 
0, and lead to a number of sums, each of which corre- 
sponds to a possible pseudoknot topology. While these 
sums need not be evaluated in order to calculate the par- 
tition function, many similar sums must be. As discussed 
in section ^ each of these contributions will be expressed 
as sums involving D and G propagators, with indices run- 
ning over appropriate ranges. In order to perform these 
summations efficiently, and describe them compactly, we 
must introduce new propagators. These are indexed ob- 
jects similar to the already defined D and G, and they 
also fall into two classes, those which have two indices, 
and those which have four. 

The first new two-indexed propagator is Zij itself, and 
the second, Z mn , is the pseudoknot contribution — that 
is, the O (N~ 2 ) part of Z mn . The different two-indexed 
propagators are related by the following equations, 



Z^ — G^ + ^ 2 Zij 



(22) 



FIG. 6: Diagrams defining Z and Z. 



Now, one can see from the Bethe-Salpeter equation for 
Dij-ki that it always has bonds both between the ith and 
jth bases and between fcth and Zth bases. It is often 
convenient (and efficient) to make different assumptions 
about which bonds are implicitly defined when imple- 
menting a dynamic programming algorithm on a com- 
puter. To this end, we define new four-indexed propaga- 
tors, the first of which is, 



Eij-ki — 



y , y , Dij-p q Gp 

V=k q=j 



l,kGl,q+l 



(23) 





We draw Eij-ki in diagrams exactly like Dij-ki, except 
that we draw it with a dotted edge between k and I to 
represent the fact that it doesn't contain an explicit hy- 
drogen bond between those two nucleotides. The physical 
picture of the propagator D^-ki presented in — i.e., two 
spatially separated runs of nucleotides (one from j to /, 
the other from k to i) that form nucleotide bonds with 
each other — is equally applicable to E. However, instead 
of the two strands being bound to each other at both 
ends, here the inner ends of each strand may be loose. 



FIG. 8: Diagrams for K and L. 



FIG. 7: Defining relation for E- 



It's also useful to define two other four-indexed prop- 
agators, K and L. Both K^-m and Lij-ki have explicit 
bonds connecting i and j. However, Kij-^i is defined to 
have a bond connecting to k, but not necessarily to Z. 
Likewise, Lij-^i will always have a bond connecting at Z, 
but may not have a bond connecting to k. 

I i 

G i>n+1 (24) 

m—j n—k 
I i 

Lij-kl = y ' y ' Dim;nlGn-l,kG m -ij (25) 
m—j n—k 

Diagrams for K and L have a dotted edge drawn to the 
nucleotide which doesn't have an explicit bond connect- 
ing to it, and solid edges drawn to the nucleotides that 
do attach to bonds. 



Propagator 


Explicitly bonded bases 


Dij-ki 


i, j, k, and I 


Eij-ki 


i and j 


Kij-ki 


k and j 


Lij;kl 


I and i 



TABLE I: Four-indexed propagators 



V. RECURSION RELATION AND VERTICES 

In 0, two independent methods were presented that al- 
lowed one to calculate a partition function from (J2J, the 
integral over A. The first method was steepest descent, 
reviewed in section IIIII and which was carried to com- 
pletion in 0- The second method, involving recursively 
integrating out rows and columns of A, was presented in 
section VI of Q. 



Zli — Zl-i,i 

L-l 



j=i 



L-l 



J=l 

.-1 

Y, V "( M L?j + l( X ) M Jl( X ) 



L-l 



(26) 
(27) 

(28) 

(29) 



j=i 



This method gives a recursion relation (to O (N- 2 )) for 
Z which depended on objects, called "vertices" , which 
were not calculated explicitly. Here, we will use the steep- 
est descent in order to find a form for the vertices that 
can be summed directly by a computer. It turns out that, 
in order to find a usable recursion relation, both methods 
must be used in concert. 
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In order to use (|27() to find the O (-/V -2 ) partition func- 
tion, vertex operators for the insertion of bonds which 
cross other bonds are required. We can find these vertices 
by performing a steepest descent calculation on the recur- 
sion relation presented in section VI of and then do- 
ing the fluctuation integral. This procedure will produce 
a modest proliferation of terms which can be interpreted 
with Feynman diagrams and then summed explicitly. 

The first of these corrections, (I27II . 

L-l 

J=l 

is easy to evaluate because 

{Mj£ ld (x)) =Z kj (30) 
I 




FIG. 9: Diagram of 1^ 



We need to truncate the product at O (JV 2 ) . The result 
can be written in terms of the Z propagator introduced 
in 

{ M Zj+l) ( M Jl) = ZL-LJ+lZj-!,! 

— GL-l,J+lGj-i t i + Gl-1,. 7+1^,7-1,1 

+ Z L _ 1 . J+1 Gj- 1A +0(N- A ) (31) 

Next we calculate (|28[) by doing the fluctuation inte- 
gral /m^xMJ j + x) by Wick contraction, as detailed in 
appendix El This yields 



— GL-ij+iTjjGj—i^ (32) 

I 

Here, we have introduced the amputated vertex operator 
r, which can be defined very simply in terms of E. 

Fy = ^y;J+l,J-l (33) 
The remaining term in the recursion relation is 



2_, G L-i,i+iG j-i^Dij-kiGk-ij+iG j -ij+i — 



M 



L,J+1 IVI J1 



= { exp 



E 

p=3 



H) p T P 
pNP/ 2 " 1 



\q=0 



Nil 2 



G 



L.J+l 



\r=Q 



{-i) r B r 
N r / 2 



G 



71i 



o,c 



This expands out to 



are contained in appendix 1X1 



— ^(y3 1 G)(73 3 G) + (B 2 G)(B 2 G) + (B 3 G)(BiG) 
+ ^ [{B X G)(B 2 G) + (B 2 G)(B 1 G)} 

^ + ^](B 1 G)(B 1 G)^ (34) 

We suppres the indices on these terms because they are 
all the same: (B p G)L,j+i{B q G)j-i^. These can be eval- 
uated the same way that the terms in l|32|l were, by Wick 
contracting the powers of x implicit in the Ts and Bs. 
This is detailed in appendix [5] Diagrams representing 
each term are shown in fig. I1UI and explicit summations 



These are modifications of diagrams calculated in , 
with an unsaturated nucleotide than can accept a hydro- 
gen bond at J. Not all diagrams in [2( are modified this 
way, nor are all insertion points included, because doing 
so would produce pseudoknots of O (tV -4 ) . We have fur- 
ther simplified notation by suppressing the Gs and angle 
brackets. It is also convenient to discuss "amputated" 
vertices, that is, the sum of all the terms in (|34() and 
w ith the external Gl-1,% and propagators 
removed. This amputation insures that there are bonds 
attaching to the subscript indices of both 0^- and . 
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Zlx = Zl-i,i (35) 

L-l 

+ ^ Vlj (Zl-ij+iGj-i^i + Gl-i,j+iZl-i,i) 
j=i 

(36) 

1 

+ JT ^ ^ VLjG L -X, m +l^m n G n -\,\ (37) 
J—l mn 

L-l 

+ ^2 ^2 VljG L-i tm +l®mrfin-l,l (38) 
J—l mn 

Using the definitions for the vertex operators Tfj and 

Qfj, as well as the relation ^Ej > we can recursively cal- 
culate the partition function for an RNA that contains 
a single pseudoknot. However, we can easily extend this 
equation so that it consistently accounts for RNAs that 
have any number of pseudoknots. This is done by "filling 
in" the G propagators in (|35[) terms of the relation, to 
turn them into Zs with no penalty to efficiency. 

Z L1 = (39) 

L-l 

+ J2 V L jZ L _ h j +1 Zj- ltl (40) 
j=i 

1 

+ Jf ^ ^ VLjGL-l,m+l^m n G n -l,l (41) 
J—l mn 

L-l 

,m+l 

G„-i,i (42) 

J—l mn 

VI. CONCLUSION 

It may be apparent that the execution time of the algo- 
rithm presented here grows as L 6 , because the limiting 
step is the calculation of the various vertices. Though 
a few of them actually require more than the three in- 
dices and three summations implied by that order, those 
sums can be restricted in a manner similar to that used 
to calculate the four-indexed propagators. 

The model for bonding we use is somewhat simplistic, 
in that we account only for independent bonds. How- 
ever, it should not be difficult to incorporate stacking 
interactions, or loop penalties, or other more advanced 
scoring methods into the algorithm, while retaining the 
topologies and recursion relation obtained from the large 
TV analysis. 




(g) BiBi Tf 
FIG. 10: Distinct terms in 9^ 




L m n 1 



FIG. 11: Diagram form of recursion. 
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APPENDIX A: EXPLICIT VERTICES 



diagrammatic form in fig. 1101 we also present the explicit 
summations in this appendix, to simplify efforts to im- 



The 0(N- 2 ) vertex term, 6/- is, as shown in sectionEJ plement the algorithm. 



a sum of several terms. While these terms are shown in 



,7-2 ,7-1 

(-61-63)0=^ ^ J5i,J+l;J+l,fc-l-K'j-l,j;fe! 
l=j k=l+l 
,7-2 i 

l=j k=J+2 
i—1 i 

{B 3 Bi} = 22 L^j + x-kiEk-i,j;i+i,j-i 
l=,J+l k=l+l 

J-3 ,7-2 J-l i 

(BiB2Ts) = ^ ^i,ra+l;k,m— l-Efc-l,J+l;J+l,n-Kj-l,i;mJ 

l=j n=l+l m=n+l fc=,7+l 
,7-2 i-2 i-1 i 

(-B2-Bi^3)o = y] Lij + i^, n -iEk-ij+i-m,j-iE m -ij-„i 

l—j rc— 7+2 m— n+1 fc=m+l 
,7-3 ,7-2 i-1 i 
(BlBiT 4 ) — £i,7+l;fc,m-l-E'fc-l,Z+l;mn-?0-l 

i=j n=l+lm=J+2/c=m+l 

J-4 ,7-3 J-2 i-2 i-1 i 

(-B1-B1T3) = X! ^y+i^^-i^-in+i™? 

^— j ra=Z+l p— n+1 o— ,7+2 m— o+l fc— m+1 
x ^ro-l,i+l;on-*0-l,j;p+l,l 



(Al) 
(A2) 
(A3) 
(A4) 
(A5) 
(A6) 
(A7) 



APPENDIX B: EVALUATION OF INTEGRALS 



We calculate (|28|l by evaluating the integral 



lir-l \ 



JJ+l/ c 



p=3 



2-1 ' 



Vg=0 



Ll 



7v r / 2 

vr=0 



1T3 T 4 Ti 



3A7l 47V 18W 



1 lBl B2 \( 

1 ¥ |( ' 



Ll 



i-i^-^iG 

at? iv 



(Bl) 



(B2) 



7,7+1 



0,C 



We have only expanded to (V x ) because of the ex- by symmetry, since they depend on odd powers of x, so 
plicit factor of 1 /N in • The fiV"*") terms vanish this sim P lifies to 
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M£ 1 1 M7j +1 



Gl\G 



1 /T 4 T| 



AT V 4 18 



3N 



^ {(B 2 G) L1 Gj,j +1 + G L1 (B 2 G)jj +1 } - ^(B 1 G) L1 (i? 1 G) J , J+1 \ (B3) 

/ o,c 



The 0(1) term (Gl~i,iGj-i.j + i) vanishes be- 
cause it depends on the "backwards propagator" 
Gj-u+i. Four OiN- 1 ) terms, (T 4 G L _ ltl Gj_ ltJ+1 ), 
(TiG L ^ 1A Gj~ 1J+1 ), (T 3 (B 1 G) L1 Gj- 1J+1 ) and 
((B 2 G)liGj-ij + i), vanish for the same reason. 
This leaves us with only three terms to compute, 



r 



(T 3 G L - M ( J B 1 G) J , J+1 ), {G L - X<1 {B 2 G)j, J+l ) and 
((BiG)li{B\G)j,j + i) . Th e first tw o are shown in 
diagram form in figs. 13(a) and 13(b) respectively; they 
obviously vanish. The third is shown as a diagram in 
fig. Inland can be written as an explicit sum, 



((BiG) li{B\G) — ^ GL-i,i+iGj-isDij.kiGk-ij+\Gj~\j + i — G^-i^+ii^j; j_iGj_i,i (B4) 

I 



The left hand side depends on L while the right hand side of matrix products that we introduced in section InTl 
depends on L — 1 because of the non-standard definition The same integration technique applies to (|34|l . 



M 



L,J+1 M J1 



= 1- 



T 4 



T 2 



iBj_ Eh iB 3 

Ari/2 _/v iV 3 / 2 



1 - 



G 



iV 1 / 2 



B2 

N 



iB 3 

AT3/2 



G 



(B5) 



Jii o,c 



r 



We drop some terms in the expansion because they can- 
not yield non- vanishing connected expectation values. 
The remaining terms can be expanded to O (A^ ~ 2 ) . The 
fractional coefficients cancel over-counting in terms that Wick contraction. The contractions which remain are 

I 



depend on traces, which are symmetric under cyclic per- 
mutations of the ccs. 

As an example, we evaluate the first of the terms by 



((B2G)l,j+i{B2G)j i i) 0C — GL-i,i+iGk-i,rn+iG -i,,j+iGj-i, p +iG n -ij+iGj-i t i(Di n] kpD m j ]0 i + Di 0] k m Dpj ]n i) 

(B6) 



GL,i+iGj^ 14 Eu-k+i^j-iEkj-, j +1,1-1 



The first of the tw o Wick contractions is shown as a 
diagram in fig. 12(a) while the second term, shown in fig. 
12(b)| vanishes. The non-vanishing part can be greatly 



simplified with the use of the appropriate propagators. 
The advantage of the propagators introduced in section 
IIVI should now be obvious; we have reduced the number 
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(b) Vanishing contraction. FIG. 13: Vanishing contractions in T 

FIG. 12: Contractions in (B 2 B 2 ) 
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